MULTIPLE RECURRENCE AND CONVERGENCE FOR 
SEQUENCES RELATED TO THE PRIME NUMBERS 



NIKOS FR ANTZIKIN AKIS , BERNARD HOST, AND BRYNA KRA 

Abstract. For any measure preserving system (X, X, fj,, T) and A €E X 
with p,(A) > 0, we show that there exist infinitely many primes p such 
that n(A n T^~ l) A n T- 2(p_1) A) > (the same holds with p - 1 re- 
placed by p + 1). Furthermore, we show the existence of the limit in 
L 2 (n) of the associated ergodic average over the primes. A key ingredi- 
ent is a recent result of Green and Tao on the von Mangoldt function. 
A combinatorial consequence is that every subset of the integers with 
positive upper density contains an arithmetic progression of length three 
and common difference of the form p — 1 (or p + 1) for some prime p. 



1.1. Results about the primes. The von Mangoldt function A: Z — > R, 
defined by 



plays a major role in understanding the distribution of the prime numbers. 
The classical circle method is the study of the Fourier transform of A re- 
stricted to an interval [l,iV], that is, evaluating the correlation of A with 
complex exponentials on this interval. 

Green and Tao (|12j. |13j . |14j ) generalize this method by estimating the 
correlations of A with "nilsequences" and then deduce an estimate for the 
third Gowers norm of a modification of A (a precise definition of this norm 
is given in Section [2J). This result is the base of our study; we need some 
notation to state it explicitly. 

Let P be the set of prime numbers. If w is a positive integer and r E Z, 
setting 
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I. Introduction 




if n = p m for some m £ N and p € P 
otherwise 



w = n p> 
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for n G N we define 



KAn) = ^P--MWn + r) 



where (/> is the Euler function. It is easier to work with this modified von 
Mangoldt function, as it is better distributed over congruence classes than 
the original function. 

If N is a prime number we identify [0, iV — 1] with Z/NZ in the natural 
way and consider the restriction A 1 „ i jv ]r (n) of K WtT {n) to this interval as 
a function on Z/iVZ. It follows immediately from the results in ^1] that 
(notation explained in Section EJ): 

Theorem 1 (Green and Tao 14 ). For r G Z with (r, W) = 1 we have 

(A W) JV,r(n) - l) • l[0,[A73l)( n ) = OA^oo;«>(l) + O^^l). 

We use this result to derive several results in ergodic theory, on recurrence 
and convergence properties of some sequences related to the primes, and use 
them to deduce a combinatorial consequence. 

1.2. Multiple recurrence along primes. The connection between addi- 
tive combinatorics and ergodic theory began with Szemeredi's celebrated 
theorem |2U| . which states that a subset of the integers with positive upper 
density contains arbitrarily long arithmetic progressions. Furstenberg [H] 
found an equivalent formulation of Szemeredi's Theorem in terms of multi- 
ple recurrence. He proved this "correspondence principle" and showed that 
if (X, X, /i, T) is a measure preserving system 1 , A G X has positive measure, 
and k is a positive integer, then there are infinitely many positive integers 
n such that 

(i) /i(inr n Anr 2n in...nr te A)>o. 

A natural question is what restrictions can be placed on n such that 
the measure of the intersection in Q remains positive. This motivates the 
following definition: 

Definition 1. Given an integer k > 1, S C N is a set of k-recurrence if for 
every measure preserving system (X, X , n,T) and A G X with fJ-(A) > 0, 
there exist infinitely many n G S such that 

n(A n T~ n A n T~ 2n n • • • n T~ kn A) > o . 

Via Furstenberg's correspondence principle, the equivalent combinatorial 
formulation of this definition is classical: 



A measure preserving system is a quadruple (X, X, fx, T), where (X, X, /i) is a proba- 
bility space and T : X — > X is a measurable map such that fi(T~ 1 A) = /-i(A) for all A £ X. 
Without loss of generality we can assume that the probability space is Lebesgue. 
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Proposition. Let k > 1 be an integer. The set S C N is a set of k- 

recurrence if and only if every subset A of integers with positive upper density 
contains at least one arithmetic progression of length k + 1 and common 
difference in S. 

A set of fc-recurrence is also known in the number theory literature as a 
k-intersective set. 

There are many known examples of sets of 1-recurrence. For example, 
one can take all multiples of a fixed number or S — S for any infinite set 
ScN. Furstenberg |Hj and Sarkozy JHj independently showed that values 
of a polynomial, meaning to {q(n) : n £ N} where q(n) is an integer valued 
polynomial with q(0) = 0, form a set of 1-recurrence. Another interesting 
example comes from the primes. Sarkozy showed that the set of shifted 
primes P — 1 (and the set P + 1) form a set of 1-recurrence. Wierdl |21| 
reproved this result using methods from ergodic theory. 

For higher k, restricting the exponents in (JIJ is more difficult. It is an 
immediate consequence of Szemeredi's theorem that one can take the set of 
all multiples of a fixed nonzero integer. Bergelson and Leibman showed 
that polynomial values {q(n) : n £ N} where q(n) is an integer valued poly- 
nomial with q(0) = are sets of ^-recurrence for all A; > 1. On the other 
hand, Furstenberg 9 constructed an infinite set S whose set of differences 
S — S is not a set of 2-recurrence. For general k, a set of fc-recurrence but 
not (k + l)-recurrence was constructed in j7]. 

The recent results of Green and Tao on the von Mangoldt function allow 
us to show 2-recurrence for the primes: 

Theorem 2. Let (X, X, fj,, T) be a measure preserving system and let A G X 
have positive measure. There exist infinitely many p £ P such that 

n[A n t-(p-^a n t-^p-^a) > 

and there exist infinitely many p £ P such that 

n[A n T-( p+1 ^A n t- 2 ^a) > . 

It is easy to check that the set P + r is not even a set of 1-recurrence for 
r 6 Z \ {—1, 1}, by considering a periodic system with period r if r ^ and 
with period 2 if r = 0. A related topological version of this question was 
posed by Brown, Graham, and Landman [1]. 

A corollary of Furstenberg's correspondence principle is that any set of in- 
tegers with positive upper density contains arithmetic progressions of length 
3 with common difference in P — 1 (and also in P + 1). 

The higher order statement of Theorem ^was conjectured to be true in 
|14j . if it holds then the obvious generalizations of our proofs show that 
P — 1 and P + 1 are sets /c-recurrence for every k £ N. This and other 
generalizations are discussed in Sectional 

1.3. Convergence of averages along primes. A closely related question 
is the convergence of the associated multiple ergodic averages. In his proof 
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of Szemeredi's Theorem, Furstenberg proved more than the measure of the 
intersection in Q is positive. He showed that given a measure preserving 
system (X, X, fi, T) and A £ X of positive measure, we have 

(2) lim inf — v( An T ~ nA n^ 2n in...n T~ kn A) > . 

0<n<Ar 

A natural question is the existence of the limit of the associated ergodic 
average. More generally, given a measure preserving system (X, X , fj,,T), 
functions /i, /2, ■ • • j /fe € L°°(fi), and an increasing sequence of integers 
{s n } n gNi we can study the existence of the limit of 

_L Y, (T s -f 1 -T 2s -f 2 -...-T ks "f k ) 

0<n<N 

or, writing 5* = {s n : n > 1}, the limit of 

P) JsT^m E < T '/l • T^h ■■■■■ T"h) 

1 s€S,s<N 

in L 2 (fi) as N — > oo. 

In ^1 > it is shown that the limit exists for s n = n (another proof is given 
in |23|). If q{n) is an integer valued polynomial, the existence of the limit 
for {q(n) : n £ N} is shown in 12] (see |17j for a more general result). It was 
a folklore theorem that for k = 1, the limit exists for the primes; it follows 
easily from results on exponential sums due to Vinogradov and was written 
down explicitly by Wierdl [2^ . (Bourgain j3j and Wierdl [22] actually proved 
several stronger results on pointwise convergence along primes.) 

Let tt(N) denote the number of primes less than or equal to N. Using 
Theorem n we show: 

Theorem 3. Let (X, X,fi, T) be a measure preserving system and let j\, 
ji : X — > C be bounded measurable functions. Then the limit 

v ' p&¥,p<N 

exists in L 2 (fi). 

Moreover, we show that for k = 2 a certain factor, known as the Kronecker 
factor, controls the limiting behavior in L 2 ([i) of the corresponding multiple 
ergodic averages. Furthermore, in a totally ergodic system (meaning T and 
all its powers are ergodic), we show that the average over the primes has 
the same limit as the average over the set of positive integers. 

To prove multiple recurrence for the shifted primes and convergence re- 
sults for the primes, we compare the multiple ergodic average related to the 
primes with the standard multiple ergodic average over the full set of inte- 
gers. Using estimates in Lemma|31and the uniformity estimates of Green and 
Tao in Theorem we then show that the difference of some modification of 
the two averages tends to zero. Combining this with an ergodic version of 
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Roth's theorem, we prove double recurrence for the shifted primes in Sec- 
tion |3J In a similar manner, we approach the convergence questions for the 
double average over the primes. In Section [3 we prove double convergence 
and in Section H3 we find a characteristic factor for this average. 



2. Notation and Gowers norms 

By o N (1) we mean a quantity that depends on N, a\, . . . , a k 

and for fixed ai,...,dfc converges to zero as N — > oo; the convergence is 
assumed to be uniform with respect to all variables that are not included in 
the indices. 

Let 

A (n) = lp(n) • logn , 
where lp denotes the indicator function of the primes. We have 

(5) 1 £ (A(„)_A»)4 £ ^. logp 

0<n<N 0<p<VN 

_ logN ^ 1 2 



o<p<Vn 

When / is a function defined on a finite set A, let E (/(n) | n G A) 
E rag A/(n) denote the average value of / on A: 



E„, eA /(n) = ^ T ^/(n) , 



A\ 

1 1 n<=A 

where by | v4. | we mean the number of elements in A. We also use a higher 
dimensional version of the expectation. For example, by E mj „ gj 4/(n, m), we 
mean iteration of the one variable expectation: 

E meA (E neA f(n, m)) . 

Some of our addition and averages are taken in Z/iVZ and some in Z. In 
order to distinguish between the two senses, we use the expectation notation 
E ngZ / iVZ for an average in Z/NZ and Ylo<n<N f° r an average in Z. 

If /: Z/iVZ — > C, we inductively define: 

H/ll W iv Z ) = | E (/(«)heZ/iVZ)| 

and 



U d+1 {Z/NZ) = [ E (\\fh- f\\u d (Z/NZ) I h G Z / ArZ - 



l/2 d + 



where fh{ n ) = /(w + /i). Gowers showed that for d > 2 this defines a 
norm on Z/NZ. 
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3. Some lemmas 

For studying an average over the primes, it is convenient to replace this 
average with a certain weighted average over the integers. The next lemma 
enables us to do this. 

Lemma 1. // \a n \ < 1 for n€N, then 

An E a p~N E ( A («)-< 



ir(N) 



ON- 



>(1) 



p£P,p<7V 0<n<N 

where p(N) denotes the greatest prime number less than or equal to N. 

Proof. As noted in equation ©, we can replace A(n) by A'(n) making only 
a small error. Then we have 



i E E (A(n)-a n )|<l £ |(logiV-logp). 

V ' p£¥,p<N 



p£P,p<N 



in 



0<n<N 

^ E E A '(' 

p£P,p<-/V 0<n<7V 

Using equation (J5J again, we have that up to a small error term this last 
difference is equal to 



A E fcgtf-^ E A H 



peP,p<Af 



0<n<Ar 



By the prime number theorem and the well known fact that A has mean 
one, this difference goes to zero as N — > oo. □ 

The next Lemma is only used in the proof of Lemma 01 

Lemma 2. For k £ N, let 9, </>o, . . . , 4>k~i ■ Z/AZ — » C be functions with 
\4>i\ < 1 f or 1 < 2 < — 1 and let (j)Q be arbitrary. Then 

^m,nez/Nz(6(n) ■ 0o(m) • 0i(m + n) ■ . . . -(j)k-i{m + (k - l)n)) 



< 



\U k (Z/NZ) ' 1 1 W 1 1 L 2 (Z/NZ) ' 



w/iere ||0olljyi(z/jvz) = ( E nez/Nz\M n )\ 2 ) 1/2 ■ 
Proof. We make use of the identity 



E nez/ivza(n) = E n>fe62/7VZ (a(n + /i) • a(n)) , 
which holds for a: Z/AZ — ► C. We use induction in A;. For k = 1, 

E m ,„ e z/Arz(^(n) • 0o(m)) = E„ez/ivz • E meZ/7VZ o (m) 



< 



9 0|Il2(Z/7VZ) 
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Suppose the statement holds for k = I. We show that it also holds for 
k = I + 1. Applying Cauchy-Schwarz and the previous identity we have that 



E m ,„ 6 z/jvz(0(n) • (f>o{m) ■ 0i (m + n) ■ . . . ■ <f>i(m + In)) 



< E 



m&Z/NZ 

= E 



E nsZ/2Vz(0(») • M m + n) • . . . • 4>i(m + In)) ■ \\ 
m,n,h£Z/Nz(0(n + h) ■ 0(n) • $, h (m) • ^., ft (m + n) 



5 oIIl 2 (z/atz) 



<t>'i-x,h{ m + ( l - 1)")) • II0o|Il2 (z/jvz) , 



where ^ ft (m) = </>j+i (m + /i) • </>j+i (m) satisfies | <^ J < 1 for £ = 0, . . . , I — 1. 
The last average equals 

^hez/Nz(^m,nez/Nz(0(n + h) ■ 6{n) ■ 4>' Q)h {m) ■ (f)' hh (m + n) ■ . . . ■ 

0i-i,A(m+(l-l)n))) • 

< 1, the last 



By the induction hypothesis and the estimate 
average is bounded by 



J o,h 



L 2 (Z/NZ) 



E 



hez/NZ 



(n + h)- 6{n) 



Ui(Z/NZ) 



< IE 



■-heZ/NZ 



9{n + h)-9(n) 



Ui(Z/NZ) 



|2 

\U l+1 (Z/NZ) ■ 



1/2 1 



□ 



This completes the induction. 

We use [x] to denote the greatest integer less than or equal to x. 

Lemma 3. Let k > 2 and N > k be integers, and 8: Z/iVZ — > R 6e a 
function that is zero for [N/k] < n < N . Let (X, X,u,T) be a measure 
preserving system and fi, ■ ■ ■ , fk-i'- X — > C 6e measurable functions with 
H/illoo < 1 /ori = 1,... 1. T/ien 

' £ (0(n)'T n f 1 '...-T^ n f k . 1 ) <C k -\\0\ 



[N/k] 

1 ' 1 0<n<[N/k] 

for some constant Cj. > 0. 



U k (Z/NZ) 



Proof. Let /o € L°°(n) be arbitrary. We apply Lemma [21 for the functions 
x ,i(n) : Z/iVZ -» R defined by 



(t>x,o{n) 



fo(T n x) if < n < [N/k] 
if [N/k] <n<N 



and (p x ,i( n ) = fi(T n x) for £ = 1, . . . , k — 1. Note that for < m, n < N we 
have 

6(n) ■ 4>xfl{fn) ■ (f> x i(m + n) ■ . . . ■ ^ x ,k(m + (k- l)n) = 
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except when < m,n < [N/k], in which case the above expression is equal 
to 

9{n) ■ f (T m x) ■ h(T m+n x) f k ^{T m+ ^ n x) . 
Thus by Lemma |21 

E ■ M 7 ™ 1 ) ■ M Tm+nx ) ■ ■■■ fk-i(T m+{h - 1)n x)) 

0<m,n<[N/k] 

< P\\ umnz) ■ E l/o(^)l 2 ) 1/2 - 

0<n<[N/k] 

Integrating over X and applying Cauchy-Schwarz to the right hand side 
gives 

/ MX) " Wlk] E • ■■■■ h-i(T {k - l)n x)) d» 

J L / J <n<[AT/fc] 

/ N \3/2 

- \[N/k\) ' » e Wu h WNZ) • ll/ollzao*) • 
By duality, we have the advertised estimate with = (2k) 3 / 2 . □ 

4. Recurrence 

Theorem follows immediately from the next result, which we prove 
using Theorem ^ (the same statement holds for P + 1, with the obvious 
modifications). 

Theorem 4. Let (X, X, fj,, T) be a measure preserving system and A G X 
with fJ,(A) > 0. Then 

liminf— \- V / i(Anr( p - 1 )Anr 2 ( p - 1 »A)>o. 

TV^oo 7t(N) ^ ^ ' 

Proof. By Lemma Q it suffices to show that for prime numbers iV we have 
liminf — V (A(n+l)-^nr ,i Anr 2n yl))>0. 

0<n<N 

For this, it suffices to show that for some w£Nwe have 

(6) Ijminf^ £ (A w ^ Ntl (n) ■ fj,(A n T~ Wn A n T~ 2Wn A)) > , 

^ ' ^ 0<n<[N/3] 

where TV = ripeP,p<w^- We c l arm that 

(7) ^n^r E ((V,iW-i)'M^nrtnr*A)) 

^ ' J 0<n<[jV/3] 

— Ou>^oo(l) . 
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To see this, we first apply Cauchy-Schwarz to get 



^ £ ((A«,tf,i(n) - 1) • ^ n T Wn A n T 2 ^)) 



< 



[iV/3] 
1 



0<n<[7V/3] 



[iV/3] 



Y {(Aw,N,M - 1) • T Wn l A ■ T 2Wn l A ) 



L 2 M 



U 3 (Z/NZ) 



0<ne<[iV/3] 

By Lemma |21 this last term is bounded by 

(A W) jv,i(n) - 1) • 1[o,[jv/3]) 
and by Theorem ^ this is 

ON— >oo;u> 

(1) + O w -^oo(l) • 

Letting N — > oo gives (jTJ) . 

We now proceed to show (J5J. Let = 5. Roth's theorem easily implies 
(see Theorem 2.1 in |2] for details) that for every measure preserving system 
(Y,y, v, S) and B G y with u(B) > S there exists a constant c(<5) > such 
that 

(8) liminf 7 -?— V 1/(5 n 5^5 n S- 2n B) > c(8) . 

V ' N^oo N/3] ^ V V - V 7 

L ' J 0<n<[iV/3] 

Using (JTJ) and applying (JSJ) for the systems (X, A', T^, /i), we have 
hminf-i- Y, C A ^N,i(n) • M (A fl T~ Wn A n T~ 2 ^)) 

^ ' ^ 0<n<[A73] 

> hminf —L- ^ HAnr^nr 2H/ «A) + o^(i) 



jv-^oo [JV/3] 

Taking tu sufficiently large, we have @, completing the proof. 



0<n<[iV/3] 
>c(6) + O w ^oo(l) . 



□ 



5. I?- CONVERGENCE 
We now prove convergence of averages along the primes: 

Proof of Theorem^ We can assume that H/iH^ < 1 for i = 1,2. By 
Lemma Q it suffices to prove the corresponding results for the weighted 
averages 



(9) 



1 



0<n<N 
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For i£l let a x (n) = h{T n x) ■ f 2 (T 2n x). We claim that 
1 



(10) 



[WW/3] 



£ (A(n)-a x (n))- 



0<n<[WN/3] 



= On^c 

To prove (fTU|) first note that 

WW] ^ (A(n) • a x (n)) 
1 1 J 0<n<[(WN)/3] 

<f>(W) 

w 



^fy E [a^j E a ^ w 

^ V ' 0<r<W L ' J 0<n<[7V/3] 
(r,W)=l 



n + r. 



L2( M ) 



— y — r 

w\ ^ r AT/31 ^ 

0<n<[Af/3] 



m) ^ TJ/ [jv/3] 



0<r<VK 
(r,W)=l 



A(Ifn + r) • a x (Wn + r)) 



(1) 

=^F) E nvT^T E (^,JV,r(«) -a x (Wn + r)) + o N ^ ;w (l) , 

VV ; 0<r<W [ 1 J 0<n<[JV/3] 
(r,W)=l 

where the error terms are introduced because A is supported on the prime 
powers rather than the primes and the use of integer parts. Hence, 



1 



[WAT/3] 



(A(n) -a x {n)y 



0<n<[WN/3] 

^7) E E MWn + r)) 

Yy ' 0<r<W L 1 1 0<n<[7V/3]] 
(r,W)=l 



1 111 



0<r<W 
(r,W)=l 



L 2 M 

) - 1) ■ a x {Wn + r)) 



0<n<[N/3] 



L 2 M 



~ C ' 6iW\ E W^N/S) ■ (K,N,r( n ) ~ V, 
^ V ; 0<r<W 
(r,W)=l 



U 3 (Z/NZ) 



+ OJV^oo;w(1) 
+ OjV-»oo:iu(l) • 



The last inequality follows by applying Lemma El with S 1 = T w and ^ 
r ri / i5 % = 1,2. Now (dOI) follows using Theorem □ 
We proceed to show that the sequence 



A X (N) = E <n<7v(A(n) • a x (n)) 
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converges in L {pi) by showing that it is a Cauchy sequence. Let e > and 
B x ,w,r{N) = j^— Yl < Wn + r ) • 

[ ' J 0<n<[AT/3] 

Using (|1U|) and the fact that B xwr {N) converges in L 2 ([i) we get that for 
some Wo, if M,N are sufficiently large then 

A x ((W Q N)/3) - ( B ^oAN)) L2{fi) < e/3 , 



6{W) 

YV ' 0<r<W 



(r,W)=l 

\\B x ,w ,r{N) — B X}W0jr {M)\\ L2 , ■. < e/3 , 

for all < r < Wq with (r, Wo) = 1. Using this and the triangle inequality 
we have that if M, N are large enough then 

\\A x ((W N)/3) - A X ((W M)/3)\\ L2M < e . 

Since 

A x {{W N)/3 + i) = A x {{W N)/3) + ojv^oc(l) 
for < i < Wo/3, we conclude that A X {N) is Cauchy. □ 

6. A CHARACTERISTIC FACTOR FOR THE AVERAGE (@J) 

A factor of a measure preserving system (X, X, fi,T) is defined to be a 
T-invariant sub-cr-algebra Z of X. The factor Z is characteristic for L 2 - 
convergence of the averages in Q) if f\ and f% can be replaced by their 
conditional expectations E(/i | Z) and E(/2 | Z) without changing the value 
of the limit, taken in L 2 {p). The Kronecker factor K, is defined to be the 
smallest sub-cr-algebra of X with respect to which the eigenfunctions of T 
are measurable. 

Theorem 5. Let {X, X, T) be an ergodic measure preserving system and 
let fx, /2 : X — > C be bounded measurable functions. Then the Kronecker 
factor K, is characteristic for L 2 {pi) convergence of the average in @. Fur- 
thermore, if {X, X,fjL, T) is a totally ergodic system, then the limit of the 
average in @ is equal to 

l im J_ V" • T 2n f 2 ) . 

0<n<N 

Proof. We first show that the Kronecker factor is characteristic. It suffices 
to show that if either f\ or f 2 is orthogonal to the Kronecker factor then the 
average in Q converges to zero in L 2 {/i). By a x (N), A X {N) and B X:W>r (N) 
we denote the sequences defined in the proof of Theorem |31 As it was shown 
in jSj, for every w and r the sequence B XjW:r (N) converges to zero in L 2 (/x) 
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as N — > oo. Since liniTv^oo A X (N) exists, by (fTU|) we have that 
lim A X (N) = lim A x (WN/3) 

= lim B XiW:r (N) + o t0 _» DO (l) = o to _ i . 00 (l) , 

N— >oo 

where all the limits are taken in L 2 (/i). The result follows by letting w — > oo. 

Next, we evaluate the limit for totally ergodic systems. For any such 
system we have for every w, r £ N and /i, /2 £ L°°(fi) that 

( H ) J im ~\7 E T Wn+r f 1 .T 2Wn+r f 2 = lim 1 ^ T n h-T 2n h, 

N^oo I\ * — * N^oo iv — 

0<n<N 0<n<iV 

where both limits are taken in L 2 (fj,). One can see this using the formula 
for the limit of these averages given in [Hj. Equation (jllj) gives 

lim B XjW>r (N) = lim — V 0a .(JV) 

0<n<N 

for every u;,r G N. Since A^iV) converges in L 2 (n) we conclude from (|lUj) 
that for every u> 

lim ^(JV)= lim A K ((WiV)/3) = lim - V a x {n) + o^^l) , 

N^oo N^oo N^oo iV 

0<n<N 

where all the limits are taken in L 2 (/u). The result follows by letting w — ► 
oo. □ 

7. Further generalizations 

It is natural to ask about higher order recurrence for the shifted primes 
and higher order convergence along the primes. The major missing ingredi- 
ent is the higher order statements of Theorem ^ with respect to the Gowers 
norms. If such estimates hold, then again using a uniform version of the mul- 
tiple recurrence theorem of Furstenberg (contained in [2]), our proof carries 
over. For convergence, the proof also carries over, using the general result 
on convergence of linear averages in ^Hj. Furthermore, the generalization of 
Theorem El holds, using the description of the characteristic factors in JH] 
and the higher analog of identity in jHj . 

With small modifications the argument used to prove Theorems and 01 
also gives analogous recurrence and convergence results for two commuting 
transformations. (The proofs of these are almost identical to the proofs 
given and so we omit them.) Namely, if T\ and T 2 are commuting invertible 
measure preserving transformations of a probability space (X, X, fi) and 
A £ X with fJ,(A) > 0, then there exists pGP such that 

^(4nr- (p - 1) inr 2 - (!, - 1) i)>o. 

The analogous statement holds with p + 1 instead of p — 1. Replacing 
the role of the uniform version of Furstenberg's recurrence theorem is a 
uniform version of the multidimensional Szemeredi Theorem of Furstenberg 
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and Katznelson If one can obtain an estimate for the fc-th Gowers 

norms of A — 1 for k > 4, analogous to the one of Theorem ^ then as 
for a single transformation, one would obtain the higher order commuting 
version. 

Similarly, one can show that if /i, /2 £ L°°(fx), then the averages 

converge in L 2 (n) as iV — » oo. Again, we compare this average with the 
standard ergodic average for two commuting transformations, whose con- 
vergence was proven in 5 . Since convergence is not known for the standard 
average of k > 3 commuting transformations, even with higher order esti- 
mates on the Gowers norm of A — 1, our proof would not generalize. 
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